Overview
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 4000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.4 MiB |
| Average record size in memory | 622.2 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 8 |
| Numeric | 9 |
title has unique values | Unique |
text has unique values | Unique |
trust_score has 44 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2026-01-15 12:17:12.275431 |
|---|---|
| Analysis finished | 2026-01-15 12:17:28.607815 |
| Duration | 16.33 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
title
Text
Unique
| Distinct | 4000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 292.0 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.72325 |
| Min length | 15 |
Unique
| Unique | 4000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Breaking News 1 |
|---|---|
| 2nd row | Breaking News 2 |
| 3rd row | Breaking News 3 |
| 4th row | Breaking News 4 |
| 5th row | Breaking News 5 |
| Value | Count | Frequency (%) |
| breaking | 4000 | |
| news | 4000 | |
| 3969 | 1 | < 0.1% |
| 3999 | 1 | < 0.1% |
| 3998 | 1 | < 0.1% |
| 3997 | 1 | < 0.1% |
| 3996 | 1 | < 0.1% |
| 3995 | 1 | < 0.1% |
| 3994 | 1 | < 0.1% |
| 3993 | 1 | < 0.1% |
| Other values (3992) | 3992 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8000 | 11.3% |
| 8000 | 11.3% | |
| r | 4000 | 5.6% |
| a | 4000 | 5.6% |
| k | 4000 | 5.6% |
| B | 4000 | 5.6% |
| i | 4000 | 5.6% |
| n | 4000 | 5.6% |
| g | 4000 | 5.6% |
| N | 4000 | 5.6% |
| Other values (12) | 22893 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 70893 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 8000 | 11.3% |
| 8000 | 11.3% | |
| r | 4000 | 5.6% |
| a | 4000 | 5.6% |
| k | 4000 | 5.6% |
| B | 4000 | 5.6% |
| i | 4000 | 5.6% |
| n | 4000 | 5.6% |
| g | 4000 | 5.6% |
| N | 4000 | 5.6% |
| Other values (12) | 22893 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 70893 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 8000 | 11.3% |
| 8000 | 11.3% | |
| r | 4000 | 5.6% |
| a | 4000 | 5.6% |
| k | 4000 | 5.6% |
| B | 4000 | 5.6% |
| i | 4000 | 5.6% |
| n | 4000 | 5.6% |
| g | 4000 | 5.6% |
| N | 4000 | 5.6% |
| Other values (12) | 22893 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 70893 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 8000 | 11.3% |
| 8000 | 11.3% | |
| r | 4000 | 5.6% |
| a | 4000 | 5.6% |
| k | 4000 | 5.6% |
| B | 4000 | 5.6% |
| i | 4000 | 5.6% |
| n | 4000 | 5.6% |
| g | 4000 | 5.6% |
| N | 4000 | 5.6% |
| Other values (12) | 22893 |
text
Text
Unique
| Distinct | 4000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 530.3 KiB |
Length
| Max length | 79 |
|---|---|
| Median length | 79 |
| Mean length | 78.72325 |
| Min length | 76 |
Unique
| Unique | 4000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | This is the content of article 1. It contains detailed analysis and reports. |
|---|---|
| 2nd row | This is the content of article 2. It contains detailed analysis and reports. |
| 3rd row | This is the content of article 3. It contains detailed analysis and reports. |
| 4th row | This is the content of article 4. It contains detailed analysis and reports. |
| 5th row | This is the content of article 5. It contains detailed analysis and reports. |
| Value | Count | Frequency (%) |
| article | 4000 | 7.7% |
| detailed | 4000 | 7.7% |
| reports | 4000 | 7.7% |
| it | 4000 | 7.7% |
| this | 4000 | 7.7% |
| is | 4000 | 7.7% |
| the | 4000 | 7.7% |
| content | 4000 | 7.7% |
| of | 4000 | 7.7% |
| contains | 4000 | 7.7% |
| Other values (4002) | 12000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 48000 | ||
| t | 32000 | |
| s | 24000 | 7.6% |
| e | 24000 | 7.6% |
| i | 24000 | 7.6% |
| a | 24000 | 7.6% |
| n | 24000 | 7.6% |
| o | 16000 | 5.1% |
| c | 12000 | 3.8% |
| r | 12000 | 3.8% |
| Other values (19) | 74893 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 314893 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 48000 | ||
| t | 32000 | |
| s | 24000 | 7.6% |
| e | 24000 | 7.6% |
| i | 24000 | 7.6% |
| a | 24000 | 7.6% |
| n | 24000 | 7.6% |
| o | 16000 | 5.1% |
| c | 12000 | 3.8% |
| r | 12000 | 3.8% |
| Other values (19) | 74893 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 314893 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 48000 | ||
| t | 32000 | |
| s | 24000 | 7.6% |
| e | 24000 | 7.6% |
| i | 24000 | 7.6% |
| a | 24000 | 7.6% |
| n | 24000 | 7.6% |
| o | 16000 | 5.1% |
| c | 12000 | 3.8% |
| r | 12000 | 3.8% |
| Other values (19) | 74893 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 314893 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 48000 | ||
| t | 32000 | |
| s | 24000 | 7.6% |
| e | 24000 | 7.6% |
| i | 24000 | 7.6% |
| a | 24000 | 7.6% |
| n | 24000 | 7.6% |
| o | 16000 | 5.1% |
| c | 12000 | 3.8% |
| r | 12000 | 3.8% |
| Other values (19) | 74893 |
state
Categorical
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 256.5 KiB |
| Washington | 225 |
|---|---|
| California | 225 |
| Florida | 220 |
| Pennsylvania | 219 |
| Massachusetts | 215 |
| Other values (15) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 8.63825 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Tennessee |
|---|---|
| 2nd row | Wisconsin |
| 3rd row | Missouri |
| 4th row | North Carolina |
| 5th row | California |
Common Values
| Value | Count | Frequency (%) |
| Washington | 225 | 5.6% |
| California | 225 | 5.6% |
| Florida | 220 | 5.5% |
| Pennsylvania | 219 | 5.5% |
| Massachusetts | 215 | 5.4% |
| Indiana | 206 | 5.1% |
| Maryland | 203 | 5.1% |
| Wisconsin | 203 | 5.1% |
| Illinois | 199 | 5.0% |
| Ohio | 199 | 5.0% |
| Other values (10) | 1886 |
Length
| Value | Count | Frequency (%) |
| new | 387 | 8.5% |
| washington | 225 | 4.9% |
| california | 225 | 4.9% |
| florida | 220 | 4.8% |
| pennsylvania | 219 | 4.8% |
| massachusetts | 215 | 4.7% |
| indiana | 206 | 4.5% |
| maryland | 203 | 4.4% |
| wisconsin | 203 | 4.4% |
| illinois | 199 | 4.4% |
| Other values (12) | 2267 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4193 | |
| a | 3888 | |
| n | 3689 | 10.7% |
| s | 3048 | 8.8% |
| o | 2370 | 6.9% |
| e | 2343 | 6.8% |
| r | 2130 | 6.2% |
| l | 1447 | 4.2% |
| h | 1019 | 2.9% |
| t | 837 | 2.4% |
| Other values (26) | 9589 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34553 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 4193 | |
| a | 3888 | |
| n | 3689 | 10.7% |
| s | 3048 | 8.8% |
| o | 2370 | 6.9% |
| e | 2343 | 6.8% |
| r | 2130 | 6.2% |
| l | 1447 | 4.2% |
| h | 1019 | 2.9% |
| t | 837 | 2.4% |
| Other values (26) | 9589 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34553 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 4193 | |
| a | 3888 | |
| n | 3689 | 10.7% |
| s | 3048 | 8.8% |
| o | 2370 | 6.9% |
| e | 2343 | 6.8% |
| r | 2130 | 6.2% |
| l | 1447 | 4.2% |
| h | 1019 | 2.9% |
| t | 837 | 2.4% |
| Other values (26) | 9589 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34553 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 4193 | |
| a | 3888 | |
| n | 3689 | 10.7% |
| s | 3048 | 8.8% |
| o | 2370 | 6.9% |
| e | 2343 | 6.8% |
| r | 2130 | 6.2% |
| l | 1447 | 4.2% |
| h | 1019 | 2.9% |
| t | 837 | 2.4% |
| Other values (26) | 9589 |
category
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 255.8 KiB |
| Business | |
|---|---|
| Health | |
| Politics | |
| Sports | |
| Technology |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.44125 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Entertainment |
|---|---|
| 2nd row | Technology |
| 3rd row | Sports |
| 4th row | Sports |
| 5th row | Technology |
Common Values
| Value | Count | Frequency (%) |
| Business | 724 | |
| Health | 695 | |
| Politics | 665 | |
| Sports | 644 | |
| Technology | 639 | |
| Entertainment | 633 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| business | 724 | |
| health | 695 | |
| politics | 665 | |
| sports | 644 | |
| technology | 639 | |
| entertainment | 633 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3903 | |
| s | 3481 | |
| e | 3324 | 9.8% |
| n | 3262 | 9.7% |
| i | 2687 | 8.0% |
| o | 2587 | 7.7% |
| l | 1999 | 5.9% |
| h | 1334 | 4.0% |
| a | 1328 | 3.9% |
| c | 1304 | 3.9% |
| Other values (12) | 8556 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 33765 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 3903 | |
| s | 3481 | |
| e | 3324 | 9.8% |
| n | 3262 | 9.7% |
| i | 2687 | 8.0% |
| o | 2587 | 7.7% |
| l | 1999 | 5.9% |
| h | 1334 | 4.0% |
| a | 1328 | 3.9% |
| c | 1304 | 3.9% |
| Other values (12) | 8556 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 33765 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 3903 | |
| s | 3481 | |
| e | 3324 | 9.8% |
| n | 3262 | 9.7% |
| i | 2687 | 8.0% |
| o | 2587 | 7.7% |
| l | 1999 | 5.9% |
| h | 1334 | 4.0% |
| a | 1328 | 3.9% |
| c | 1304 | 3.9% |
| Other values (12) | 8556 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 33765 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 3903 | |
| s | 3481 | |
| e | 3324 | 9.8% |
| n | 3262 | 9.7% |
| i | 2687 | 8.0% |
| o | 2587 | 7.7% |
| l | 1999 | 5.9% |
| h | 1334 | 4.0% |
| a | 1328 | 3.9% |
| c | 1304 | 3.9% |
| Other values (12) | 8556 |
sentiment_score
Real number (ℝ)
| Distinct | 201 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.000645 |
| Minimum | -1 |
|---|---|
| Maximum | 1 |
| Zeros | 20 |
| Zeros (%) | 0.5% |
| Negative | 2005 |
| Negative (%) | 50.1% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -0.9 |
| Q1 | -0.49 |
| median | -0.01 |
| Q3 | 0.51 |
| 95-th percentile | 0.89 |
| Maximum | 1 |
| Range | 2 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5747681 |
|---|---|
| Coefficient of variation (CV) | -891.11334 |
| Kurtosis | -1.1930524 |
| Mean | -0.000645 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.0043120571 |
| Sum | -2.58 |
| Variance | 0.33035837 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.64 | 34 | 0.9% |
| -0.31 | 30 | 0.8% |
| 0.12 | 30 | 0.8% |
| -0.01 | 30 | 0.8% |
| 0.25 | 29 | 0.7% |
| 0.67 | 29 | 0.7% |
| -0.84 | 28 | 0.7% |
| 0.87 | 28 | 0.7% |
| 0.13 | 28 | 0.7% |
| -0.2 | 28 | 0.7% |
| Other values (191) | 3706 |
| Value | Count | Frequency (%) |
| -1 | 11 | |
| -0.99 | 25 | |
| -0.98 | 18 | |
| -0.97 | 16 | |
| -0.96 | 23 | |
| -0.95 | 24 | |
| -0.94 | 19 | |
| -0.93 | 16 | |
| -0.92 | 21 | |
| -0.91 | 19 |
| Value | Count | Frequency (%) |
| 1 | 9 | 0.2% |
| 0.99 | 18 | |
| 0.98 | 17 | |
| 0.97 | 13 | |
| 0.96 | 19 | |
| 0.95 | 22 | |
| 0.94 | 25 | |
| 0.93 | 14 | |
| 0.92 | 12 | |
| 0.91 | 26 |
word_count
Real number (ℝ)
| Distinct | 1324 |
|---|---|
| Distinct (%) | 33.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 795.65575 |
| Minimum | 100 |
|---|---|
| Maximum | 1500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 163 |
| Q1 | 445.75 |
| median | 793 |
| Q3 | 1150 |
| 95-th percentile | 1429 |
| Maximum | 1500 |
| Range | 1400 |
| Interquartile range (IQR) | 704.25 |
Descriptive statistics
| Standard deviation | 406.37387 |
|---|---|
| Coefficient of variation (CV) | 0.51074082 |
| Kurtosis | -1.2085476 |
| Mean | 795.65575 |
| Median Absolute Deviation (MAD) | 352 |
| Skewness | 0.0054378973 |
| Sum | 3182623 |
| Variance | 165139.72 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 104 | 10 | 0.2% |
| 930 | 10 | 0.2% |
| 549 | 9 | 0.2% |
| 315 | 9 | 0.2% |
| 1318 | 8 | 0.2% |
| 1199 | 8 | 0.2% |
| 480 | 8 | 0.2% |
| 878 | 8 | 0.2% |
| 1117 | 8 | 0.2% |
| 154 | 8 | 0.2% |
| Other values (1314) | 3914 |
| Value | Count | Frequency (%) |
| 100 | 4 | 0.1% |
| 101 | 2 | 0.1% |
| 102 | 2 | 0.1% |
| 103 | 3 | 0.1% |
| 104 | 10 | |
| 105 | 2 | 0.1% |
| 106 | 5 | |
| 107 | 2 | 0.1% |
| 108 | 2 | 0.1% |
| 109 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 1500 | 4 | |
| 1498 | 2 | 0.1% |
| 1497 | 4 | |
| 1496 | 6 | |
| 1495 | 2 | 0.1% |
| 1494 | 3 | |
| 1493 | 3 | |
| 1492 | 6 | |
| 1490 | 3 | |
| 1489 | 2 | 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2014 | |
| 1 | 1986 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2014 | |
| 1 | 1986 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2014 | |
| 1 | 1986 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2014 | |
| 1 | 1986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2014 | |
| 1 | 1986 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2014 | |
| 1 | 1986 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 1 | 1938 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 1 | 1938 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 1 | 1938 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 1 | 1938 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 1 | 1938 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 1 | 1938 |
readability_score
Real number (ℝ)
| Distinct | 2734 |
|---|---|
| Distinct (%) | 68.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.764595 |
| Minimum | 30.02 |
|---|---|
| Maximum | 79.98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 30.02 |
|---|---|
| 5-th percentile | 32.8695 |
| Q1 | 42.48 |
| median | 54.235 |
| Q3 | 67.215 |
| 95-th percentile | 77.5005 |
| Maximum | 79.98 |
| Range | 49.96 |
| Interquartile range (IQR) | 24.735 |
Descriptive statistics
| Standard deviation | 14.404027 |
|---|---|
| Coefficient of variation (CV) | 0.26301713 |
| Kurtosis | -1.2047308 |
| Mean | 54.764595 |
| Median Absolute Deviation (MAD) | 12.395 |
| Skewness | 0.046218813 |
| Sum | 219058.38 |
| Variance | 207.47598 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36.95 | 5 | 0.1% |
| 67.61 | 5 | 0.1% |
| 46.17 | 5 | 0.1% |
| 74.22 | 5 | 0.1% |
| 38.12 | 5 | 0.1% |
| 51.79 | 5 | 0.1% |
| 33.27 | 5 | 0.1% |
| 55.55 | 4 | 0.1% |
| 54.36 | 4 | 0.1% |
| 69.92 | 4 | 0.1% |
| Other values (2724) | 3953 |
| Value | Count | Frequency (%) |
| 30.02 | 1 | < 0.1% |
| 30.03 | 3 | |
| 30.04 | 2 | |
| 30.05 | 1 | < 0.1% |
| 30.09 | 1 | < 0.1% |
| 30.1 | 1 | < 0.1% |
| 30.11 | 1 | < 0.1% |
| 30.12 | 1 | < 0.1% |
| 30.13 | 1 | < 0.1% |
| 30.14 | 2 |
| Value | Count | Frequency (%) |
| 79.98 | 1 | < 0.1% |
| 79.97 | 2 | |
| 79.95 | 1 | < 0.1% |
| 79.93 | 4 | |
| 79.92 | 1 | < 0.1% |
| 79.91 | 2 | |
| 79.9 | 3 | |
| 79.89 | 1 | < 0.1% |
| 79.83 | 2 | |
| 79.8 | 1 | < 0.1% |
num_shares
Real number (ℝ)
| Distinct | 3849 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25144.597 |
| Minimum | 39 |
|---|---|
| Maximum | 50000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 39 |
|---|---|
| 5-th percentile | 2504.8 |
| Q1 | 12781.75 |
| median | 25308.5 |
| Q3 | 37453.5 |
| 95-th percentile | 47642.15 |
| Maximum | 50000 |
| Range | 49961 |
| Interquartile range (IQR) | 24671.75 |
Descriptive statistics
| Standard deviation | 14387.537 |
|---|---|
| Coefficient of variation (CV) | 0.57219201 |
| Kurtosis | -1.1935124 |
| Mean | 25144.597 |
| Median Absolute Deviation (MAD) | 12319 |
| Skewness | -0.015970686 |
| Sum | 1.0057839 × 108 |
| Variance | 2.0700123 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1478 | 3 | 0.1% |
| 16019 | 3 | 0.1% |
| 28861 | 3 | 0.1% |
| 7886 | 3 | 0.1% |
| 36500 | 2 | 0.1% |
| 23784 | 2 | 0.1% |
| 20074 | 2 | 0.1% |
| 47161 | 2 | 0.1% |
| 35955 | 2 | 0.1% |
| 8461 | 2 | 0.1% |
| Other values (3839) | 3976 |
| Value | Count | Frequency (%) |
| 39 | 1 | |
| 45 | 1 | |
| 68 | 1 | |
| 84 | 1 | |
| 98 | 1 | |
| 135 | 1 | |
| 169 | 2 | |
| 175 | 1 | |
| 183 | 1 | |
| 187 | 1 |
| Value | Count | Frequency (%) |
| 50000 | 1 | |
| 49987 | 1 | |
| 49981 | 1 | |
| 49976 | 1 | |
| 49962 | 1 | |
| 49949 | 1 | |
| 49947 | 1 | |
| 49936 | 1 | |
| 49932 | 1 | |
| 49918 | 1 |
num_comments
Real number (ℝ)
| Distinct | 982 |
|---|---|
| Distinct (%) | 24.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 489.87025 |
| Minimum | 0 |
|---|---|
| Maximum | 1000 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 49 |
| Q1 | 238 |
| median | 483 |
| Q3 | 741 |
| 95-th percentile | 947 |
| Maximum | 1000 |
| Range | 1000 |
| Interquartile range (IQR) | 503 |
Descriptive statistics
| Standard deviation | 287.43573 |
|---|---|
| Coefficient of variation (CV) | 0.58675891 |
| Kurtosis | -1.1930491 |
| Mean | 489.87025 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 0.051458798 |
| Sum | 1959481 |
| Variance | 82619.301 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 633 | 12 | 0.3% |
| 349 | 11 | 0.3% |
| 526 | 11 | 0.3% |
| 169 | 11 | 0.3% |
| 926 | 10 | 0.2% |
| 319 | 10 | 0.2% |
| 807 | 10 | 0.2% |
| 859 | 10 | 0.2% |
| 33 | 10 | 0.2% |
| 387 | 10 | 0.2% |
| Other values (972) | 3895 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 4 | |
| 2 | 6 | |
| 3 | 4 | |
| 4 | 7 | |
| 5 | 4 | |
| 6 | 7 | |
| 7 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| 9 | 3 |
| Value | Count | Frequency (%) |
| 1000 | 5 | |
| 999 | 3 | 0.1% |
| 998 | 2 | 0.1% |
| 997 | 7 | |
| 996 | 5 | |
| 995 | 8 | |
| 994 | 2 | 0.1% |
| 993 | 3 | 0.1% |
| 992 | 3 | 0.1% |
| 991 | 5 |
political_bias
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 242.3 KiB |
| Left | |
|---|---|
| Center | |
| Right |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.992 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Center |
|---|---|
| 2nd row | Left |
| 3rd row | Center |
| 4th row | Center |
| 5th row | Right |
Common Values
| Value | Count | Frequency (%) |
| Left | 1357 | |
| Center | 1325 | |
| Right | 1318 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| left | 1357 | |
| center | 1325 | |
| right | 1318 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4007 | |
| t | 4000 | |
| L | 1357 | 6.8% |
| f | 1357 | 6.8% |
| C | 1325 | 6.6% |
| n | 1325 | 6.6% |
| r | 1325 | 6.6% |
| R | 1318 | 6.6% |
| i | 1318 | 6.6% |
| g | 1318 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19968 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 4007 | |
| t | 4000 | |
| L | 1357 | 6.8% |
| f | 1357 | 6.8% |
| C | 1325 | 6.6% |
| n | 1325 | 6.6% |
| r | 1325 | 6.6% |
| R | 1318 | 6.6% |
| i | 1318 | 6.6% |
| g | 1318 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19968 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 4007 | |
| t | 4000 | |
| L | 1357 | 6.8% |
| f | 1357 | 6.8% |
| C | 1325 | 6.6% |
| n | 1325 | 6.6% |
| r | 1325 | 6.6% |
| R | 1318 | 6.6% |
| i | 1318 | 6.6% |
| g | 1318 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19968 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 4007 | |
| t | 4000 | |
| L | 1357 | 6.8% |
| f | 1357 | 6.8% |
| C | 1325 | 6.6% |
| n | 1325 | 6.6% |
| r | 1325 | 6.6% |
| R | 1318 | 6.6% |
| i | 1318 | 6.6% |
| g | 1318 | 6.6% |
fact_check_rating
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 241.1 KiB |
| Mixed | |
|---|---|
| FALSE | |
| TRUE |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.679 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FALSE |
|---|---|
| 2nd row | Mixed |
| 3rd row | Mixed |
| 4th row | TRUE |
| 5th row | Mixed |
Common Values
| Value | Count | Frequency (%) |
| Mixed | 1372 | |
| FALSE | 1344 | |
| TRUE | 1284 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mixed | 1372 | |
| false | 1344 | |
| true | 1284 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2628 | |
| i | 1372 | 7.3% |
| x | 1372 | 7.3% |
| e | 1372 | 7.3% |
| M | 1372 | 7.3% |
| d | 1372 | 7.3% |
| F | 1344 | 7.2% |
| L | 1344 | 7.2% |
| A | 1344 | 7.2% |
| S | 1344 | 7.2% |
| Other values (3) | 3852 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18716 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 2628 | |
| i | 1372 | 7.3% |
| x | 1372 | 7.3% |
| e | 1372 | 7.3% |
| M | 1372 | 7.3% |
| d | 1372 | 7.3% |
| F | 1344 | 7.2% |
| L | 1344 | 7.2% |
| A | 1344 | 7.2% |
| S | 1344 | 7.2% |
| Other values (3) | 3852 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18716 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 2628 | |
| i | 1372 | 7.3% |
| x | 1372 | 7.3% |
| e | 1372 | 7.3% |
| M | 1372 | 7.3% |
| d | 1372 | 7.3% |
| F | 1344 | 7.2% |
| L | 1344 | 7.2% |
| A | 1344 | 7.2% |
| S | 1344 | 7.2% |
| Other values (3) | 3852 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18716 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 2628 | |
| i | 1372 | 7.3% |
| x | 1372 | 7.3% |
| e | 1372 | 7.3% |
| M | 1372 | 7.3% |
| d | 1372 | 7.3% |
| F | 1344 | 7.2% |
| L | 1344 | 7.2% |
| A | 1344 | 7.2% |
| S | 1344 | 7.2% |
| Other values (3) | 3852 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 1 | 1988 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 1 | 1988 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 1 | 1988 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 1 | 1988 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 1 | 1988 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 1 | 1988 |
trust_score
Real number (ℝ)
Zeros
| Distinct | 101 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.96075 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 44 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 24 |
| median | 50 |
| Q3 | 76 |
| 95-th percentile | 96 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 29.467911 |
|---|---|
| Coefficient of variation (CV) | 0.58982124 |
| Kurtosis | -1.2203769 |
| Mean | 49.96075 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 0.001527655 |
| Sum | 199843 |
| Variance | 868.3578 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 56 | 1.4% |
| 80 | 55 | 1.4% |
| 21 | 54 | 1.4% |
| 3 | 51 | 1.3% |
| 31 | 51 | 1.3% |
| 84 | 50 | 1.2% |
| 100 | 50 | 1.2% |
| 98 | 50 | 1.2% |
| 71 | 49 | 1.2% |
| 57 | 49 | 1.2% |
| Other values (91) | 3485 |
| Value | Count | Frequency (%) |
| 0 | 44 | |
| 1 | 33 | |
| 2 | 43 | |
| 3 | 51 | |
| 4 | 38 | |
| 5 | 47 | |
| 6 | 34 | |
| 7 | 48 | |
| 8 | 31 | |
| 9 | 56 |
| Value | Count | Frequency (%) |
| 100 | 50 | |
| 99 | 40 | |
| 98 | 50 | |
| 97 | 38 | |
| 96 | 40 | |
| 95 | 37 | |
| 94 | 33 | |
| 93 | 34 | |
| 92 | 41 | |
| 91 | 36 |
source_reputation
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.54925 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8742198 |
|---|---|
| Coefficient of variation (CV) | 0.51794744 |
| Kurtosis | -1.2058065 |
| Mean | 5.54925 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.041402403 |
| Sum | 22197 |
| Variance | 8.2611397 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 447 | |
| 1 | 425 | |
| 6 | 416 | |
| 10 | 413 | |
| 5 | 402 | |
| 7 | 393 | |
| 4 | 392 | |
| 3 | 385 | |
| 9 | 376 | |
| 2 | 351 |
| Value | Count | Frequency (%) |
| 1 | 425 | |
| 2 | 351 | |
| 3 | 385 | |
| 4 | 392 | |
| 5 | 402 | |
| 6 | 416 | |
| 7 | 393 | |
| 8 | 447 | |
| 9 | 376 | |
| 10 | 413 |
| Value | Count | Frequency (%) |
| 10 | 413 | |
| 9 | 376 | |
| 8 | 447 | |
| 7 | 393 | |
| 6 | 416 | |
| 5 | 402 | |
| 4 | 392 | |
| 3 | 385 | |
| 2 | 351 | |
| 1 | 425 |
clickbait_score
Real number (ℝ)
| Distinct | 101 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4944475 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 18 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.05 |
| Q1 | 0.24 |
| median | 0.49 |
| Q3 | 0.74 |
| 95-th percentile | 0.95 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.2891375 |
|---|---|
| Coefficient of variation (CV) | 0.58476886 |
| Kurtosis | -1.2008695 |
| Mean | 0.4944475 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | 0.024136733 |
| Sum | 1977.79 |
| Variance | 0.083600495 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.47 | 52 | 1.3% |
| 0.67 | 51 | 1.3% |
| 0.17 | 49 | 1.2% |
| 0.24 | 49 | 1.2% |
| 0.45 | 49 | 1.2% |
| 0.02 | 48 | 1.2% |
| 0.14 | 48 | 1.2% |
| 0.86 | 48 | 1.2% |
| 0.59 | 48 | 1.2% |
| 0.85 | 48 | 1.2% |
| Other values (91) | 3510 |
| Value | Count | Frequency (%) |
| 0 | 18 | 0.4% |
| 0.01 | 42 | |
| 0.02 | 48 | |
| 0.03 | 40 | |
| 0.04 | 45 | |
| 0.05 | 39 | |
| 0.06 | 36 | |
| 0.07 | 43 | |
| 0.08 | 33 | |
| 0.09 | 47 |
| Value | Count | Frequency (%) |
| 1 | 21 | |
| 0.99 | 39 | |
| 0.98 | 44 | |
| 0.97 | 37 | |
| 0.96 | 35 | |
| 0.95 | 33 | |
| 0.94 | 36 | |
| 0.93 | 36 | |
| 0.92 | 42 | |
| 0.91 | 36 |
plagiarism_score
Real number (ℝ)
| Distinct | 3320 |
|---|---|
| Distinct (%) | 83.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.59811 |
| Minimum | 0.04 |
|---|---|
| Maximum | 99.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.4 KiB |
Quantile statistics
| Minimum | 0.04 |
|---|---|
| 5-th percentile | 4.67 |
| Q1 | 25.915 |
| median | 51.48 |
| Q3 | 75.58 |
| 95-th percentile | 95.181 |
| Maximum | 99.95 |
| Range | 99.91 |
| Interquartile range (IQR) | 49.665 |
Descriptive statistics
| Standard deviation | 28.932298 |
|---|---|
| Coefficient of variation (CV) | 0.5718059 |
| Kurtosis | -1.1826577 |
| Mean | 50.59811 |
| Median Absolute Deviation (MAD) | 24.79 |
| Skewness | -0.046223639 |
| Sum | 202392.44 |
| Variance | 837.07787 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 67.56 | 4 | 0.1% |
| 66.53 | 4 | 0.1% |
| 55.55 | 4 | 0.1% |
| 45.96 | 4 | 0.1% |
| 68.47 | 4 | 0.1% |
| 89.94 | 4 | 0.1% |
| 7.75 | 4 | 0.1% |
| 59 | 3 | 0.1% |
| 82.05 | 3 | 0.1% |
| 29.54 | 3 | 0.1% |
| Other values (3310) | 3963 |
| Value | Count | Frequency (%) |
| 0.04 | 1 | |
| 0.06 | 1 | |
| 0.09 | 1 | |
| 0.1 | 2 | |
| 0.13 | 1 | |
| 0.15 | 1 | |
| 0.18 | 1 | |
| 0.21 | 2 | |
| 0.22 | 1 | |
| 0.27 | 1 |
| Value | Count | Frequency (%) |
| 99.95 | 2 | |
| 99.88 | 1 | < 0.1% |
| 99.82 | 2 | |
| 99.81 | 1 | < 0.1% |
| 99.75 | 1 | < 0.1% |
| 99.71 | 1 | < 0.1% |
| 99.68 | 2 | |
| 99.65 | 3 | |
| 99.64 | 1 | < 0.1% |
| 99.61 | 1 | < 0.1% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fake |
|---|---|
| 2nd row | Fake |
| 3rd row | Fake |
| 4th row | Fake |
| 5th row | Real |
Common Values
| Value | Count | Frequency (%) |
| Fake | 2026 | |
| Real | 1974 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| fake | 2026 | |
| real | 1974 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4000 | |
| e | 4000 | |
| F | 2026 | |
| k | 2026 | |
| R | 1974 | |
| l | 1974 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 4000 | |
| e | 4000 | |
| F | 2026 | |
| k | 2026 | |
| R | 1974 | |
| l | 1974 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 4000 | |
| e | 4000 | |
| F | 2026 | |
| k | 2026 | |
| R | 1974 | |
| l | 1974 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 4000 | |
| e | 4000 | |
| F | 2026 | |
| k | 2026 | |
| R | 1974 | |
| l | 1974 |
Interactions
Correlations
| category | clickbait_score | fact_check_rating | has_images | has_videos | is_satirical | label | num_comments | num_shares | plagiarism_score | political_bias | readability_score | sentiment_score | source_reputation | state | trust_score | word_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| category | 1.000 | 0.000 | 0.021 | 0.030 | 0.000 | 0.000 | 0.000 | 0.004 | 0.013 | 0.002 | 0.016 | 0.000 | 0.014 | 0.020 | 0.000 | 0.017 | 0.018 |
| clickbait_score | 0.000 | 1.000 | 0.000 | 0.000 | 0.007 | 0.017 | 0.050 | -0.002 | 0.002 | 0.015 | 0.000 | -0.011 | 0.014 | 0.026 | 0.027 | -0.013 | -0.035 |
| fact_check_rating | 0.021 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.015 | 0.000 | 0.000 | 0.017 | 0.000 | 0.000 | 0.032 | 0.000 | 0.023 | 0.000 |
| has_images | 0.030 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.004 | 0.034 | 0.000 | 0.000 | 0.008 | 0.028 | 0.000 |
| has_videos | 0.000 | 0.007 | 0.000 | 0.000 | 1.000 | 0.007 | 0.022 | 0.000 | 0.000 | 0.007 | 0.005 | 0.000 | 0.000 | 0.041 | 0.037 | 0.000 | 0.033 |
| is_satirical | 0.000 | 0.017 | 0.000 | 0.000 | 0.007 | 1.000 | 0.000 | 0.069 | 0.000 | 0.020 | 0.000 | 0.006 | 0.000 | 0.000 | 0.000 | 0.000 | 0.042 |
| label | 0.000 | 0.050 | 0.000 | 0.000 | 0.022 | 0.000 | 1.000 | 0.006 | 0.000 | 0.000 | 0.026 | 0.022 | 0.000 | 0.000 | 0.017 | 0.037 | 0.000 |
| num_comments | 0.004 | -0.002 | 0.015 | 0.000 | 0.000 | 0.069 | 0.006 | 1.000 | -0.003 | 0.025 | 0.000 | -0.026 | -0.022 | 0.007 | 0.000 | 0.000 | 0.002 |
| num_shares | 0.013 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | -0.003 | 1.000 | -0.006 | 0.033 | 0.008 | 0.029 | -0.018 | 0.000 | 0.013 | 0.002 |
| plagiarism_score | 0.002 | 0.015 | 0.000 | 0.000 | 0.007 | 0.020 | 0.000 | 0.025 | -0.006 | 1.000 | 0.038 | 0.002 | 0.000 | -0.035 | 0.012 | 0.000 | 0.031 |
| political_bias | 0.016 | 0.000 | 0.017 | 0.004 | 0.005 | 0.000 | 0.026 | 0.000 | 0.033 | 0.038 | 1.000 | 0.000 | 0.032 | 0.024 | 0.013 | 0.000 | 0.000 |
| readability_score | 0.000 | -0.011 | 0.000 | 0.034 | 0.000 | 0.006 | 0.022 | -0.026 | 0.008 | 0.002 | 0.000 | 1.000 | -0.002 | 0.015 | 0.000 | -0.020 | 0.013 |
| sentiment_score | 0.014 | 0.014 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | -0.022 | 0.029 | 0.000 | 0.032 | -0.002 | 1.000 | 0.010 | 0.000 | 0.003 | -0.004 |
| source_reputation | 0.020 | 0.026 | 0.032 | 0.000 | 0.041 | 0.000 | 0.000 | 0.007 | -0.018 | -0.035 | 0.024 | 0.015 | 0.010 | 1.000 | 0.026 | 0.004 | -0.009 |
| state | 0.000 | 0.027 | 0.000 | 0.008 | 0.037 | 0.000 | 0.017 | 0.000 | 0.000 | 0.012 | 0.013 | 0.000 | 0.000 | 0.026 | 1.000 | 0.000 | 0.000 |
| trust_score | 0.017 | -0.013 | 0.023 | 0.028 | 0.000 | 0.000 | 0.037 | 0.000 | 0.013 | 0.000 | 0.000 | -0.020 | 0.003 | 0.004 | 0.000 | 1.000 | 0.003 |
| word_count | 0.018 | -0.035 | 0.000 | 0.000 | 0.033 | 0.042 | 0.000 | 0.002 | 0.002 | 0.031 | 0.000 | 0.013 | -0.004 | -0.009 | 0.000 | 0.003 | 1.000 |
Missing values
Sample
| title | text | state | category | sentiment_score | word_count | has_images | has_videos | readability_score | num_shares | num_comments | political_bias | fact_check_rating | is_satirical | trust_score | source_reputation | clickbait_score | plagiarism_score | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Breaking News 1 | This is the content of article 1. It contains detailed analysis and reports. | Tennessee | Entertainment | -0.22 | 1302 | 0 | 0 | 66.18 | 47305 | 450 | Center | FALSE | 1 | 76 | 6 | 0.84 | 53.35 | Fake |
| 1 | Breaking News 2 | This is the content of article 2. It contains detailed analysis and reports. | Wisconsin | Technology | 0.92 | 322 | 1 | 0 | 41.10 | 39804 | 530 | Left | Mixed | 1 | 1 | 5 | 0.85 | 28.28 | Fake |
| 2 | Breaking News 3 | This is the content of article 3. It contains detailed analysis and reports. | Missouri | Sports | 0.25 | 228 | 0 | 1 | 30.04 | 45860 | 763 | Center | Mixed | 0 | 57 | 1 | 0.72 | 0.38 | Fake |
| 3 | Breaking News 4 | This is the content of article 4. It contains detailed analysis and reports. | North Carolina | Sports | 0.94 | 155 | 1 | 0 | 75.16 | 34222 | 945 | Center | TRUE | 1 | 18 | 10 | 0.92 | 32.20 | Fake |
| 4 | Breaking News 5 | This is the content of article 5. It contains detailed analysis and reports. | California | Technology | -0.01 | 962 | 1 | 0 | 43.90 | 35934 | 433 | Right | Mixed | 0 | 95 | 6 | 0.66 | 77.70 | Real |
| 5 | Breaking News 6 | This is the content of article 6. It contains detailed analysis and reports. | North Carolina | Sports | 0.83 | 920 | 0 | 0 | 42.88 | 13148 | 28 | Right | FALSE | 0 | 8 | 1 | 0.01 | 72.10 | Fake |
| 6 | Breaking News 7 | This is the content of article 7. It contains detailed analysis and reports. | Maryland | Business | 0.81 | 651 | 0 | 1 | 62.39 | 13627 | 665 | Center | Mixed | 0 | 1 | 10 | 0.47 | 97.59 | Fake |
| 7 | Breaking News 8 | This is the content of article 8. It contains detailed analysis and reports. | Maryland | Politics | -0.96 | 717 | 1 | 0 | 75.07 | 6035 | 323 | Center | TRUE | 1 | 79 | 5 | 0.58 | 75.33 | Real |
| 8 | Breaking News 9 | This is the content of article 9. It contains detailed analysis and reports. | Tennessee | Politics | -0.64 | 1093 | 0 | 0 | 73.93 | 49000 | 881 | Center | TRUE | 1 | 96 | 7 | 0.08 | 39.37 | Fake |
| 9 | Breaking News 10 | This is the content of article 10. It contains detailed analysis and reports. | Maryland | Business | -0.50 | 1421 | 0 | 1 | 51.94 | 30508 | 782 | Center | FALSE | 1 | 88 | 3 | 0.68 | 99.12 | Real |
| title | text | state | category | sentiment_score | word_count | has_images | has_videos | readability_score | num_shares | num_comments | political_bias | fact_check_rating | is_satirical | trust_score | source_reputation | clickbait_score | plagiarism_score | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3990 | Breaking News 3991 | This is the content of article 3991. It contains detailed analysis and reports. | Michigan | Entertainment | -0.41 | 235 | 1 | 1 | 35.34 | 25392 | 968 | Center | FALSE | 1 | 60 | 1 | 0.87 | 88.60 | Real |
| 3991 | Breaking News 3992 | This is the content of article 3992. It contains detailed analysis and reports. | Florida | Technology | 0.41 | 1190 | 1 | 1 | 34.52 | 23890 | 668 | Right | TRUE | 1 | 96 | 3 | 0.14 | 69.46 | Fake |
| 3992 | Breaking News 3993 | This is the content of article 3993. It contains detailed analysis and reports. | California | Sports | 0.20 | 1408 | 1 | 1 | 57.69 | 47924 | 96 | Center | FALSE | 1 | 72 | 2 | 0.65 | 73.27 | Real |
| 3993 | Breaking News 3994 | This is the content of article 3994. It contains detailed analysis and reports. | Texas | Technology | -0.45 | 1005 | 0 | 0 | 36.71 | 26287 | 592 | Left | FALSE | 0 | 7 | 8 | 0.98 | 76.95 | Fake |
| 3994 | Breaking News 3995 | This is the content of article 3995. It contains detailed analysis and reports. | Virginia | Sports | -0.26 | 405 | 1 | 0 | 42.32 | 4991 | 456 | Right | Mixed | 1 | 71 | 9 | 0.37 | 68.40 | Fake |
| 3995 | Breaking News 3996 | This is the content of article 3996. It contains detailed analysis and reports. | Ohio | Technology | 0.91 | 1227 | 1 | 1 | 67.32 | 38880 | 697 | Right | Mixed | 0 | 29 | 10 | 0.22 | 95.46 | Fake |
| 3996 | Breaking News 3997 | This is the content of article 3997. It contains detailed analysis and reports. | Washington | Sports | -0.57 | 1296 | 0 | 1 | 34.86 | 3650 | 925 | Left | FALSE | 1 | 53 | 3 | 0.42 | 16.54 | Fake |
| 3997 | Breaking News 3998 | This is the content of article 3998. It contains detailed analysis and reports. | California | Entertainment | -0.17 | 522 | 0 | 1 | 48.29 | 35391 | 577 | Left | FALSE | 0 | 22 | 9 | 0.50 | 28.51 | Fake |
| 3998 | Breaking News 3999 | This is the content of article 3999. It contains detailed analysis and reports. | Illinois | Health | -0.88 | 169 | 1 | 0 | 63.18 | 40424 | 201 | Left | FALSE | 1 | 3 | 6 | 0.17 | 71.16 | Real |
| 3999 | Breaking News 4000 | This is the content of article 4000. It contains detailed analysis and reports. | Texas | Health | -0.95 | 465 | 0 | 0 | 71.24 | 48913 | 279 | Right | TRUE | 1 | 73 | 4 | 0.09 | 27.65 | Real |